Analysis on Mel Frequency Cepstral Coefficients and Linear Predictive Cepstral Coefficients as Feature Extraction on Automatic Accents Identification

نویسندگان

  • Noraziahtulhidayu Kamarudin
  • S.A.R Al-Haddad
  • Asem Khmag
  • Abd Rauf bin Hassan
  • Shaiful Jahari Hashim
چکیده

Automatic Accents Identification is very important for discussion especially within scope of speaker recognition. Some contribution of appropriate techniques uses in Music Recognition and Accent Identification may contributes in improving the recognition rate. Techniques in discussing on music genre identification or accents automatic identification and the combination of both processes still in ambiguous for this field. This paper investigates mainly the processes involved in speech processing or identification includes: acoustic/speech signal, pre-processing, feature extraction, pattern classification and accuracy results. Process of automatic accents identification through speech signals starts with general pre-processing techniques, feature extraction; which in this studies too, comparing within two techniques; Mel-Frequency Cepstral Coefficients (MFCC) and Linear Predictive Cepstral Coefficients (LPCC).While for vocal tract with musical characteristic in used for musical genre identification and the usage of pattern classification for three methods which includes; Hidden Markov Model (HMM), Support Vector Machine (SVM) and Probabilistic Principal Component Analysis. Thus, this paper investigates the feature extraction techniques used in identifying accents to be implemented in Quranic Accents identification and proposed MFCC as better techniques for feature extraction and getting higher accuracies for 93.33% while 86.67% if compared to LPCC.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Physiologically Motivated Feature Extraction for Robust Automatic Speech Recognition

In this paper, a new method is presented to extract robust speech features in the presence of the external noise. The proposed method based on two-dimensional Gabor filters takes in account the spectro-temporal modulation frequencies and also limits the redundancy on the feature level. The performance of the proposed feature extraction method was evaluated on isolated speech words which are ext...

متن کامل

DWT and LPC based feature extraction methods for isolated word recognition

In this article, new feature extraction methods, which utilize wavelet decomposition and reduced order linear predictive coding (LPC) coefficients, have been proposed for speech recognition. The coefficients have been derived from the speech frames decomposed using discrete wavelet transform. LPC coefficients derived from subband decomposition (abbreviated as WLPC) of speech frame provide bette...

متن کامل

Artificial Neural Networks and Support Vector Machine for Voice Disorders Identification

The diagnosis of voice diseases through the invasive medical techniques is an efficient way but it is often uncomfortable for patients, therefore, the automatic speech recognition methods have attracted more and more interest recent years and have known a real success in the identification of voice impairments. In this context, this paper proposes a reliable algorithm for voice disorders identi...

متن کامل

Automatic Speaker Recognition using LPCC and MFCC

A person's voice contains various parameters that convey information such as emotion, gender, attitude, health and identity. This report talks about speaker recognition which deals with the subject of identifying a person based on their unique voiceprint present in their speech data. Pre-processing of the speech signal is performed before voice feature extraction. This process ensures the voice...

متن کامل

Spoken Language Identification Using Hybrid Feature Extraction Methods

This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) sys tem. The speech recognizers use a parametric form of a signal to get the most important distinguishable features of speech signal for recognition task. In this paper Mel-frequency cepstral coefficients (MFCC), Perceptual linear prediction coefficients (PLP) alon...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016